13. Minimizing Error Functions
Minimizing Error Functions
00:00
00:00
INSTRUCTOR NOTE:
NOTE: From 2:22 onward, the slide title should say "Mean Absolute Error".
Development of the derivative of the error function
Notice that we've defined the squared error to be
Also, we've defined the prediction to be
So to calculate the derivative of the Error with respect to
, we simply use the chain rule:
The first factor of the right hand side is the derivative of the Error with respect to the prediction
, which is
The second factor is the derivative of the prediction with respect to
, which is simply
.
Therefore, the derivative is

Exercise
Calculate the derivative of the Error with respect to
and verify that it is precisely